Semantic Integration of XML Heterogeneous Data Sources
نویسندگان
چکیده
With the current explosion of data, retrieving and integrating information from various sources is a critical problem. The designer has to specify a mediated schema providing a homogeneous view on the sources. In this paper, we report on an initial work toward automatically generating mappings between elements in the sources and in the mediated schema. Information sources we are interested in are XML documents in respect with a Document Type Definition (DTD). We describe the Xyleme project, which is the context of this study. We present our approach implemented in the SAMAG system to automatically find mappings on the basis of semantic and structural criteria. Finally, we report the first results of an experiment where SAMAG has been applied to XML documents in the cultural domain.
منابع مشابه
Integrating Heterogeneous Data Source Using Ontology
Integrating data from multiple heterogeneous sources entail dealing with different data models, schemas and query languages. The burgeoning Semantic Web has provided several new methods for data integration. This paper focuses on integration of relational database and XML data. To solve the problem we propose an ontologybased approach. A semantic integration infrastructure for heterogeneous dat...
متن کاملOWL based XML Data Integration
Data integration helps in manipulating data transparently across multiple distributed databases. The purpose of integration system is to provide a unified global view to the user over various heterogeneous data sources. To answer user queries, a data integration system employs a set of semantic mapping between global and local schema. Such integration has challenges of creation of local and glo...
متن کاملInformation Sharing for the Semantic Web -a Schema Transformation Approach
This paper proposes a framework for transforming and integrating heterogeneous XML data sources, making use of known correspondences from them to ontologies expressed in the form of RDFS schemas. Our algorithms generate schema transformation/integration rules which are implemented in the AutoMed heterogeneous data integration system. The paper first illustrates how correspondences to a single o...
متن کاملThe Indilib Approach: How to Integrate Heterogeneous SGML/XML Data by Means of the Semantic Web
This Paper introduces an approach to the integration of heterogeneous data that sets a focus on the potential provided by Berners-Lee's Semantic Web. We present a much needed comparative overview that provides a guideline for making a choice of a suitable metadata language concerning the integration of heterogeneous SGML/XML data sources within Digital Libraries. The so-called INdilib guide inv...
متن کاملPeer-to-Peer Semantic Integration of XML and RDF Data Sources
Peer-to-Peer (P2P) data management systems combine traditional schema-based integration techniques with the P2P infrastructure. In this paper, we propose a P2P data management framework named PEPSINT that semantically integrates heterogeneous XML and RDF data sources, using a hybrid architecture and a global-as-view approach. Our focus is on the query processing techniques over heterogeneous da...
متن کامل